Model Evaluation
Compare AI model performance on real SEC enforcement case predictions.
Model Performance Comparison
GPT-4o
Best
64.9%
Overall Accuracy
Resolution38.6%
Monetary53.0%
Injunction78.8%
Officer Bar89.2%
Claude Opus 4
46.8%
Overall Accuracy
Resolution38.6%
Monetary23.5%
Injunction79.8%
Officer Bar92.0%
Gemini 2.0
—
Coming Soon
Showing GPT-4o predictions on 500 evaluated cases below.
Matter
Agency
Type
Filed
Status
Score